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(54) Video signal encoding apparatus using a block shuffling technique 



(57) A video signal encoding apparatus for com- 
pressing and encoding a digital video signal comprises 
block structuring means (41) for structuring the video 
signal into a matrix array of blocks, transformation 
means (2) for 'performing an orthogonal transform on 
each of the structured blocks, and encoding means (5), 
wherein uriit structuring means (41) for structuring units 

F I B. it 



each comprising a plurality of blocks, prior to the orthog- 
onal transform by said transformation means, by shuf- 
fling the blocks in such a manner that any given 
shuffling unit and four shuffling units most adjacent to 
said given shuffling unit belong to different units. 
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Description 

BACKGROUND OF THE INVENTION 
Field of the Invention 

The present invention relates to a video signal encoding apparatus for compressing and encoding a video signal 
by d viding it into blocks and performing an orthogonal transform on each block 

Description of the Prior Art 

If video data converted to digital signals is directly recorded on tape or other recording medium, the volume of data 
will be so great that it will usually exceed the limit of the data amount that the recording medium can hold. Therefore, 
when recording a digital video signal on tape or other recording medium, it is necessary to compress it so that the data 
volume does not exceed the limit. To achieve this, it has been known to compress the video signal by using a high-effi- 
ciency encoding apparatus. 

One example of such high- efficiency enooding that has been widely used is the orthogonal transform encoding 
method in which transform coefficients obtained by orthogonal-transforming the original signal Eire quantized for encod- 
ing. This method is known to provide high encoding efficiency. In encoding a video signal by this method, the video sig- 
nal is first divided into blocks each consisting of n x n pixels (where n is an integer), an orthogonal transfor ma tion is 
performed on each block to transform it into a transform coefficient representing n x n frequency regions, and then the 
transform coefficient is quantized. However, when all blocks are quantized with the same number of bits, adequate 
image quality can be obtained for the video blocks in flat areas, but noise appears in the video blocks including edge 
areas since errors are dispersed in the vicinity of the edge areas. 

An example of an encoding apparatus that overcomes the above problem is disclosed in Japan Patent Application 
LaidOpen No.2 -105792. Fig.l shows a block diagram of the encoding apparatus disclosed in the Patent Publication. 
The encoding apparatus shewn is described below with reference to Fig. 1 . A video signal is inputted to a blocking circuit 
51 where it is divided into blocks, each block then being stpplied to an orthogonal transforming circuit 52 for orthogonal 
transformation. The transform coefficient obtained by the orthogonal transformation is quantized by a quantizing circuit 
53. The quantizing circuit 53 has the ability to perform quantization using a variable number of quantization bits. An 
edge area detecting circuit 54 is provided to detect the edges of the video signal, while a flat area detecting circuit 55 
is provided to determine whether the block represents a flat area. Based on the outputs from the edge area detecting 
circuit 54 and the flat area detecting circuit 55, a bock identifying circuit 56 determines whether the block includes an 
edge area as well as a flat area, the result of which is fed to the quantizing circuit 53 to determine the number of quan- 
tization bits. When the whole block is flat or when the whole block has a complicated structure, it is decided to use a 
smaller bit code for quantization since noise is not appreciably visibla On the other hand, if the block includes an edge 
area as well as a flat area, it is decided to use a higher bit code for quantization to prevent the generation of noise in 
the flat area. Thus, in the encoding apparatus disclosed in the above Patent Publication, in order to overcome the afore- 
mentioned problem, the transform coefficients for blocks including both edge and flat areas are quantized using a higher 
bit code to reduce the noise and thereby improve the image quality after decoding. The determining factors used to 
detect the edge or flat areas in a block include a variance within the block, the maximum value of the block, the dynamic 
range of the block, etc. These factors are oollectively referred to as the activity index. In the above prior art encoding 
apparatus, the number of quantization bits (quantization level) is selected for each block on the basis of the activity 
index. 

The output of the quantizing circuit 53 of Fig. 1 is encoded, usually using entropy encoding such as Huffman encod- 
ing, into a variable-length code for transmission. The hit length of one block after variable-length encoding varies from 
block to block, and in the case of a recording medium such as a helical scan digital video tape recorder (VTR) having a 
fixed track length, it is convenient to grasp the number of data blocks to be recorded per track. Therefore, it is a usual 
practice to predetermine at least the number of data blocks to be recorded per track. Also, when block correcting codes 
(ag.. BCH codes, Reed-Solomon codes, etc.) are employed as error-correcting codes, it may be practiced to fix the 
data length of variable-length code for each error-correcting block. Usually, in encoding of video signals, one field or 
frame is divided into N segments (where N is an integer), each segment serving as a unit, and the maximum data 
amount is set for each of the N units. 

However, in a channel, such as a digital VTR. in which the data length for the variable-length codes is fixBd, the 
data length of variable-length code may vary from code to code after variable-length encoding depending on the kind 
of the image processed, and the total code length after variable-length encoding may exceed the fixed length of the 
channel, resulting in an overflow. If this happens, the transmission will be cut off because of dataflow, and therefore, not 
only overflown data but also the subsequent data wifl not be transmitted. This presents the problem of an inability to 
correctly perform the decocfing of the original signal. 
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Variable-length encoding of a television image is usuaQy performed in sequence from left to right and from top to 
bottom of the television screen. Therefore, the problem is that the above-mentioned cutoff is likely to occur in the center 
of the television screen where the feature elements of the image are contained. 

IEEE TRANSACTIONS ON ACOUSTICS AND SPEECH AND SIGNAL PROCESSING; vol.37, no. 11, Novenfcer 

s 1989, NY, US, pages 1743-1749, NGAN et al.: "Adaptive Cosine Transform Coding of Images in Perceptual Domain' 
already discloses a video signal encoring apparatus for compressing and encoding a digital video signal to obtain 
coded data compressed within a predetermined data amount This apparatus comprises means for structuring blocks 
each consisting of a plurality of pixels in the video signal, means for performing an orthogonal transformation on each 
of the structured blocte to obtain a transform coefficient means for quantizing the transform coefficient, means for 

io encoding the quantized data to obtain coded data, means for storing the obtained coded data, and means for controlling 
the quantizing means on the basis of the amount of the coded data stored in the storage means. 

PAJP, vol. 12, no. 124 (E 601) shows a picture signal transmission system which obtains a smooth reproducing 
moving image with improved visual characteristic by dividing both surfaces into plural areas, applying priority to the 
respective areas and transmitting more information of the area of the higher priority than the information of the area of 

is the lower priority. 

PAJP. vol. 12, no. 355 (E 661) describes a picture confessing device which in order to reduce the number of quan- 
tizing errors and to shorten the operating time comprises a preprocessing means which pr eProcesses original picture 
data and a rearranging section which rearranges an orthogonally transformed output in blocte of the same frequency 
component. 

20 Further, EP-A-0 322 955 refers to a receiver for a high definition television signal in which the signal prior to trans- 
mission is sub-sampled on a block-by-block basis according to the movement. The received sub-sampled signal is 
applied to a shuffler which shuffles the pixels of blocks in a manner which is the inverse to that performed prior to trans- 
mission. 

25 SUMMARY OF THE INVENTION 

It is the object of the invention to provide a video signal encoding apparatus capable of fixing the encoded data 
length to a predetermined length wherein distortions resulting from transmission cutoffs are not easily visble even when 
the code length of the data to be transmitted is fixed. 

so According to a first embodiment of the present invention a video signal encoding apparatus for compressing and 
encoding a digital video signal containing a chrominance signal comprises block structuring means for structuring the 
video signal into a matrix array of blocks each consisting of a plurality of pixels, transformation means for performing an 
orthogonal transform on each of the structured blocks to obtain a transform coefficient, encoding means for encoding 
the obtained transform coefficient to obtain coded data, and unit structuring means for structuring units each comprising 

35 a plurality of block, prior to the orthogonal transform by said transformation means, by shuffling the blocks in such a 
manner that any given shuffling unit and four shuffling units most adjacent to said given shuffling uritbelong to different 
units, the reference of the shuffling unit being the size that the blocks of said chrominance signal occupy on the screen. 

According to a second embodiment of the present invention a video signal encoding apparatus for compressing 
and encoding a digital video signal comprises block structuring means for structuring the video signal into a matrix array 

40 of blocks each consisting of a plurality of pixels, transformation means for performing an orthogonal transform on each 
of the structured blocks to obtain a transform coefficient encoding means for encoding the obtained transform coeffi- 
cient to obtain coded data, and unit structuring means for structuring units each comprising a plurality of blocks, prior 
to the orthogonal transform by said transformation means, in such a manner that any given block and four blocks most 
adjacent to said given block belong to different units. 

45 According to a third embodiment of the present invention a video signal encoding apparatus for compressing and 
encoding a digital video signal comprises block structuring means for structuring the video signal into a matrix array of 
Mocks each consisting of a plurality or pixels, transformation means for performing an orthogonal transform on each of 
the structured blocks to obtain a transform coefficient encoding means for encoding the obtained transform coefficient 
to obtain coded data, unit structuring means for structuring units each comprising a plurality of blocks, prior to the 

so orthogonal transform by said transformation means, in such a manner that any given block and tour blocks most adja- 
cent to said given block belong to different units, decision means for deciding the order in which the blocks are to be 
encoded, the order within each unit being such that encoding is performed starting with the blocte nearer to the center 
of the screen and then proceeding to the blocks nearer to the sides of the screen, and control means for controlling the 
amount of coded data on a unit-by-unit basis. 

55 The above and further objects and features of the invention will more fuOy be apparent from the following detailed 
description with acc om panying drawings. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 rs a diagram showing the oonfiguratin of a prior art video signal encoding apparatus. 
Fig. 2 is a diagram showing the configuration of a video signal encocfng apparatus com pr i si ng a variable length 
5 encocfing circuit and a buffer memory. 

Fig. 3 ts a diagram showing an example of encoding during the process. 

Fig. 4 is a diagram showing the scanning sequence during encoding. 

Fig. 5 is a diagram showing an alternative configuration of the apparatus of Fig. 2. 

Fig. 6 is a diagram showing the configuration of a video signal encoding apparatus in accordance with a first 
w embodiment of the invention. 

Fig. 7 is a diagram explaining the operation of shuffling in the first embodiment 

Fig. 8 is a diagram explaining the principle of shuffling in the first embodiment 

Fig. 9 is a diagram showing an example of shuffling in the first embodiment. 

Fig. 10 is a diagram showing another exarrpie of shuffling in the first embodiment 
is Fig. 1 1 is a diagram showing stiO another example of shuffling in the first embodiment 

Fig. 12 is a diagram showing the configuration of a shufffing circuit in the first embodiment 

Fig. 13 is a diagram showing the configuration of a video signal encoding apparatus in accordance with a second 

embodiment of the invention. 

Fig. 14 is a diagram shewing an alternative configuration of the second embodiment. 
20 Fig. 1 5 is a diagram showing an example of shuffling in the second embodiment. 

Fig. 16 is a diagram showing another example of shuffling in the second embodiment. 

Fig. 17 is a diagram showing still another example of shuffling in the second embodiment. 

Fig. 18 is a diagram showing yet another example of shufffing in the second embodiment 

Fig. 1 9 is a diagram showing a further example of shuffling in the second embodiment 
26 Fig. 20 is a diagram showing a still further example of shuffling in the second embodiment 

In Fig. 2, the reference numeral 1 indicates a blocking circuit for dividing the input tfgrtal video signal into blocks 
each consisting of plurality of pixels. Each block is fed from the blocking circuit 1 to a OCT circuit 2. The DCT circuit 2 
performs a discrete cosine transform (DCT) on each block and supplies the obtained transform coefficient (DCT coef- 

30 fident) to a weighting circuit 3. The weighting circuit 3 performs a weighting to each DCT coefficient and supplies the 
weighted DCT coefficient to a quantizing circuit 4. The quantizing circuit 4 quantizes the weighted DCT coefficient with 
the number of quantization bits determined by a controller 8, and supplies the quantized DCT coefficient to a variable- 
length encoding circuit 5 through a switch 7. The variable-length encoding circuit 5 encodes the quantized DCT coeffi- 
cient into a variable-length code and transfers the variable-length encoded data to a buffer memory 6. The buffer mem- 

35 ory 6 is constructed from a RAM or the like and has the storage capacity equivalent to the data length of one track. The 
switch 7 turns on and off the data input to the variable-length circuit 5. The controller 8 controls the number of quanti- 
zation bits for the quantizing circuit 4. as well as the switching operation of the switch 7, on the basis of the amount of 
data stored in the buffer memory 6. , 
The operation will now be descrbed. « 

40 The data obtained by sampling the video signal is divided by tie blocking circuit 1 into blocks each consisting of. for 
example, eight pixels in both horizontal and vertical directions. The DCT circuit 2 performs a DCT on each block, and 
the obtained DCT coefficient is then performed a weighting by the weighting circuit 3. At this time, the weighting is per- 
formed so that weighting factors for DCT coefficients in higher frequency regions will be smaller values. This is because 
the visual resolution drops for higher frequency regions, allowing high-efficiency encoding without noticeable degrada- 

45 tion. Next the weighted DCT coefficient is quantized by the quantizing circuit 4. Quantized n-brt data may be expressed 
as shown in Fig.3, for exarnple. This data is encoded by the variable-length encoding circuit 5 into a variable-length 
code by performing one-dimensional scanning as shown in Fig.4. The variable-length encoding circuit 5 is a circuit for 
encoding data into a code whose length depends, for exanrple, on the string of zeros (zero run length) and nonzero 
value, and usually, the Huffman encoding and like methods are widely used. The output of the variable-length encoding 

so circuit 5 is stored in the buffer memory 6 for transfer to the transmission channel. 

However, the length of the variable-length code outputted from the variable-length encoding circuit 5 varies accord- 
ing to the image pattern and, depending on the situation, may exceed or may not reach the maximum transmissible 
code length. The controDer 8 predicts an occurrence of excess data by comparing the address value being written in 
the buffer memory 6 with the limit data length, and outputs signals to control the number of quantization bits for the 

55 quantizing circuit 4 and the witching operation of the witch 7. 

Therefore, even if the data volume instantaneously increases at a particular portion of the image on the television 
screen, the buffer memory 6 can provide a sufficient capacity for data storage, and there arises no situation that results 
in an overflow or that causes the controller 8 to direct transmission cutoff. 

Fig.5 is a block diagram showing an alter native configuration. In this alternative configuration, the controller 8 con- 
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trots only the switching operation of the switch 7. 

The preferred embodiments of the present invention wiD now be described with reference to the accompanying 
drawings. 

s (Embodiment 1) 

When a plurality of blocks is grouped into a unit, it has been a usual practice to perform enaxSng unit by unit start- 
ing from a particular position on the screen (ag. from the upper left of the screen). Therefore, the code amount varies 
largely from unit to unit, and there arises the problem that the transmission efficiency decreases when the upper limit 

10 of data amount is set to match the units having a larger code amount The fifth embodiment and the subsequent sixth 
embodiment of the invention are provided aiming at overcoming such a problem. 

. Rg. 6 is a block diagram showing the configuration of a video signal encoding apparatus in accordance with the 
fifth embodiment of the invention. In Fig. 6, the reference numerals 2, 3, 4, and 5 designate a DCT circuit a weighting 
circuit a quantizing circuit, and a variable-length encoding circuit respectively. These circuits are identical to those 

15 shown in Fig.5. At the front stage of the DCT circuit 2, there is provided a blocking/shuffling circuit 41 for divitfng a cfg- 
rtal video into blocks of a plurality of pixels and stuffing the thus obtained blocks. The block data is supplied from the 
Hocking/shuffling circuit 41 to the DCT circuit 2. The quantizing circuit 4 quantizes the weighted DCT coefficient with 
the number of quantization bits decided by a quantization bit number deciding circuit 43 and supplies the quantized 
DCT coefficient to the variable-length encoding circuit 5. The variable-length encoding circuit 5 encodes the quantized 

20 DCT coefficient, into a variable-length code and supplies the variable-length code data to a buffer memory 42. 
The operation will now be described. 

A digital video signal is inputted in scanning line sequence to the blocking/shuffing circuit 41 where the signal is 
divided into blocks of n x n pixels within one field or one frame and then shuffled in accordance, for example, with the 
shuffling format shown in Fig. 7. One block in Fig. 7 corresponds to one DCT block and the outer frame corresponds to 

25 that of the television screen. When the luminance signal conforming to the NTSC system is sampled at a rate of 
13.5MHz t for example, the effective scanning area per frame covers 720 pixels in the horizontal direction and 466 pixels 
in the vertical direction. When one frame is divided into bkxte of 8 x 8 pixels, for example, there remain six pixels each 
in the vertical direction; therefore, it is supposed here to encode the picture signal for 720 x 480 pixels, discarding the 
data for the three horizontal scanning lines from the top and bottom of the screen. Since the video signal is divided into 

so blocks of 8 x 8 pixels, this means 90 x 60 blocks, i.e., 5,400 blocks in total. That is, when the block address in the hori- 
zontal direction within one frame is denoted as i and that in the vertical direction as j, i is expressed as 1 £i£90 and j as 
1#i60. 

Furthermore, the 5,400 blocks are grouped into N units. In Fig. 7, N = 5, and the alphabetic characters in A1, B1, 
etc. assigned to each block indicate the names of the units. Since N = 5, there are five unit names A to E. The numeric 
35 parts in A1 , B1 , etc. are numbers indicating the encoding sequence within each unit. 

In Fig.7, encoding is performed, as a general rule, from left to right and from top to bottom of the screen. In the 
example shown, since there are 90 blocks in the horizontal d recti on, the second line from top in Fig.7 begins with the 
number 19 which is given by dividing 9p by N (= 5) and adding 1 to the quotient. Therefore, the block address (i, j) for 
the kth encoding in the uth unit can be expressed by the following equation ( 1 ) (provided that (1,1) indicates the top left 
40 corner of the screen and (90. 60) the bottom right corner), .pa 



i 



N x mod (k-1, -|$ 




45 



] - 1 i N ) + 1 



(1) 



J 



[ (k - 1) X N j + 1 



90 



50 



[a]: Largest integer not exceeding a 



For example, when u = 2 and k = 20. the block address is given by: 



55 



i = 5x mod(20 - 1. 18) + mod [2 + [(19 x 5)/90] - 1, 51 + 1 = 5 x 1 + mod(2. 5)+ 1 =5 + 2 + 1=8 



j = {(19x5)/90)+1 = 2 
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Thus, the block address (8, 2) is obtained. Also, u = 2 indicates the unit name is B, and in Rg. 7, the address (8. 2) des- 
ignates the Nock B20. Likewise, the address of the Hock C57, for example, can be found as follows: 

i - 5 x mod(57 - 1 , 1 8) + mod P + [(56 x 5)/90) - 1 , 5] + 1 - 5 x 2 + mod(3 + 3 - 1, 5) + 1 - 10 +0 + 1 « 11 

5 

i-4 

which gives the address (1 1 , 4). That is. Fig.7 shows the arrangement of blocks after performing shuff Gng as expressed 
by the equation (1). 

10 After the above shuffling, each block is sequentially fed to the DCT circuit 2 for a DCT transform and is then per- 
formed a weighting by the weighting circuit 3. The quantization bit number deckfing circuit 43 calculates the activity 
index of each block, on the basis of which the number of quantization bits is decided for the block, the information being 
fed to the quantizing circuit 4. The weighted DCT coefficient is quantized by the quantizing circuit 4 using the number 
of quantization bits thus decided, and the quantized data is then encoded by the variable-length encoding circuit 5 using 

15 such methods as Huffman encoding, the encoded data being transferred to the buffer memory 42 for storage therein 
As the above shuffling, the patterns represented by the blocks to be coded are randomly dispersed, and therefore, 
the code length is equalized between the units when the number of blocks is greater than a certain degree. According 
to the simulation conducted by the inventor, it has been found that when the units are assigned by shuffling as shown 
in Fig. 7. the dispersion value indicating the dispersion of the code amount is reduced to 1/5 to 1/10. compared to when 

20 a particular position on the screen is grotped together in a unit without shuffling. 

Next, the features of this shuffling will be considered. When considering the effects that the shuffBng has on the 
code amount the point is to avoid concentrating the blocks of the same pattern in the same unit which leads to the fol- 
lowing point when considered in conjunction with pixels. Blocks neighboring an attention block often have similar pat- 
terns, therefore, processing is performed to assign neighboring blocks to different units. This processing is described 

25 below using the concept of neighborhood. 

Each of the nine squares in Fig. 8 represents a DCT block. There are eight blocks (A to F in Rg. 8) that neighbors 
an attention block. These blocks are referred to as the eight neighboring blocks, of which the four blocks A, B, C, and D 
that are most adjacent to the attention block are called the four neighboring Hocks. Referring back to Fig. 7, it can be 
seen that when attention is given to a given block, none of its four neighboring blocks belong to the same unit as the 

30 attention block. Of its eight neighboring blocks, there are only two blocks that belong to the same unit The four neigh- 
boring blocks that are spatially most adjacent to the attention block are thus made to belong to different units in order 
to prevent similar patterns from being concentrated in one unit. This saves to equalize the code amount 

This effect can be achieved not only by the equation (1) but by many other methods. Figs. 9 to 1 1 show only a few 
exarrples of the many methods. In the examples of shuffling shown in Figs. 9 to 1 1 , there are no four neighboring blocks 

35 that belong to the same unit. The block address (i, j) in Fig. 9 is expressed by the following equation, .pa 
« 

j = [ mod(k-1.90) 1 x N + jN + 1 , + H) mod(h N) x fmod^N), 

j N 2 2 

4 

40 u 1 

j = mod(mod(k- 1, 90) + u - 1,N) + 1+Nx [^-i] 



For example, to find the address of the block D98, since u - 4 and k ■ 98. 

45 

i-[{mod(97, 90)1/5] x5 + 3 + H) 3 x [{mod(98. 5)}>2] - 5 + 3 - 1 » 7 
j = mod(7 + 4-1,5) + 1 +5x1=0+1+5=6 
so which gives the address (7, 6). The block address (i, j) in Rg. 10 is expressed by the following equation. 

i = mod(mod(k- 1, 90) + u. N) +[ mod ( k ^ 1,90 > ] x N 



For example, to find the address of the block E102, since u = 5 and k = 102, 
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i = mod(11+5,5) + [ll/5l x 5=1 + 10 = 11 



j-1 x5 + 3 + (-1)xt(mod(11 t 5)4lJ/2]o5 + 3- 1 »7 



5 which gives the address (11, 7). Likewise, there exists an equation that realizes the shuffling shown in Fig. 11, along 
with various other equation that achieve various other shuffling formats. 

The circuit that performs the above shuffling operations can be implemented in the configuration shown in Fig. 12. 
In the figure, the reference numeral 46 designates a block address calculating circuit for calculating the block horizontal 
address (i) ami the block vertical address (j) using the above given equations, and the block address obtained by the 

io block address calculating circuit 46 is supplied to a write/read address generating circuit 45. On the basis of the sup- 
plied block address, the write/read address generating circuit 45 outputs a writw/read address to a RAM 44. In the RAM 
44, each block is arranged according to the address, thus achieving the shuffling as shown in Figs. 7, 9, 10, and 11. 



(Embodiment 2) 

ts 

With the above shuffling, the code amount is substantially equalized between the units within one field or one 
frame, but in the case of a time-varying image, the image pattern may completely change after several seconds, caus- 
ing the code amount of each unit within one field or one frame to increase or decrease If the code amount increases, 
in each unit it may exceed the maximum transmissible data amount This presents a serious problem, particularly in 

20 the case of a helical scan VTR, since, as previously described, each track is divided into lengths so that each length is 
the result of dividing the track length by an integer, each limited fixed amount being assigned to codes for a fixed 
number of blocks. The second embodiment of the invention is devised to overcome this problem. The following 
describes the second embodiment. 

Figs 13 and 14 are block cfiagrams each showing the configuration of a video signal encoding apparatus in acoord- 

25 ance with the second embodiment In Fig. 1 3, the quantizing circuit 4 and the variable-length encoding tircurt 5 are con- 
trolled using the information on the memory usage in the buffer memory 42. In Fig. 14, the quantization bit number 
deciding circuit 43 and the variable-length encoding circuit 5 are controlled using the information on the memory usage 
in the buffer memory 42. 

The buffer memory 42 has the capacity to store data of the volume that matches the limit code amount. When the 
so buffer memory 42 nears its full capacity, the probability increases that codes will be generated that may exceed the 
transmissible limit, and therefore, control is performed to reduce the number of quantization bits, cut off the variable- 
length encocfing, etc. However, such control only succeeds in reducing the code amount by sacrificing the image quality 
after decoding. As a result of the aforementioned shuffling, there is a possibility, for example, that such control may be 
effected while processing the center portion of the screen. Since the probability is high that such control is performed 
35 on the blocks near the end of each unit, the blocking/shuffling circuit 41 operates in such a manner that the blocks 
entered near the end of each block are positioned at the edge of the screen. An example of such shuffling is shown in 
Fig.15. 

In Fig.15, it can be seen that blocks with smaller numbers are clustered nearer to the center of the screen, while , 
blocks with larger numbers are clustered on both sides of the screen. Furthermore, in Fig 1 5. which shows the shuffling * 
40 with N = 5, none of any attention block and the four neighboring blocks belong to the same unit. When the block address 
of the kth block in the uth unit is denoted as (i, j). the shuffling shown in Fig. 15 is expressed by the following equation. 



60 

j = I^I "I x N ♦ 1 - x N x [ ^^V ♦ mod ( [^] + u - 1, N] 

50 

For example, to find the address of the block C134, since u = 3 and k = 134, 

i = 45 - ( -1) 11 x [6] = 45 + 6 = 51 

55 

j = ([65/10] - 1) x 5 + 1 - (-1) 1 x 5 x 1 + mod(1 1+3-1, 5) =26 + 5 + 3 = 34 



which gives the address (51 . 34). 

There are many examples of such shuffling, other than that described above, some of which are shown in Rgs.16 
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to 20. In Figs. 18 to 20, the block vertical address starts from the top of the screen; tt has been confirmed by simulation 
that distortion is likely to occur on both sides of the screen, as in the case of Fig. 15. The shuffling of Fig. 1 9 is similar to 
that of Fig. 18, except that N = 10, providing 10 units names A to K (I is not used as it can be confused with 1). Also, the 
shufffing of Rg.20 is similar to that of Fig. 18. except that N = 3. The stuffing such as shown in Fig. 18 is expressed by 
$ the following equation. 

i » . (.„ ^ * ^ x[ " ] 



j = Nxmod(k-1,^) + 1 ^modl ^" 1 ^ N ] + u-1. 



15 N For example, in Fig. 18, to find the address of the block E147, since u = 5 and k = 147, 

i = 45 - (-1) 12 x [(12+ 1)/2] = 45-6 = 39 



j = 5 x 2 + 1 + mod(1 2 + 5 - 1 . 5) = 1 1 + 1 = 1 2 

20 

which gives the address (39. 12). 

With the above-described shuffling, distortion resulting from the code amount control is driven to both sides of the 
screen. In the above equation, when N = 2, the possibility may arise that some of four neighboring blocks belong to the 
same unit, but this is the problem that arises as a result of the shuffling so performed as to position the blocks nearer 

26 to the sides of the screen as k increases. Since such blocks appear only on limited portions of the screen, they do not 
substantially affect the distribution of the code amount In an ortf nary digital VTR, this does riot present any problems 
in actual use since N is usually set at 3 or a greater number considering special replay modes, etc. In any of the above 
figures illustrating the snuffing the operation is based on divisions with N as the modulus, but it will be appreciated that 
similar effects can be obtained when the divisions are performed by taking an integral multiple of N or a quotient of an 

30 integer by an integer of N as the modulus. For example, when N = 10. usually 10 is taken as the modulus, but either 20 
or 5(10 x (1/2)] may be taken as the modulus. 

In the above first and second embodiments, shuffling is performed in units of blocks, but alternatively, shuffling may 
be performed by, for example, grouping (t x s) blocks into one unit. 

In the above embodiments, orthogonal transform has been descrbed by taking the DCT as an example, but it will 

35 be appreciated that other orthogonri transforms than the DCT, such as Hadamard transform, K-L transform, etc. may 
also be used. Also, the weighting circuit 3 may be omitted in a configuration in which the quantization width of the quan- 
tizing circuit 4 is made to vary depending on the frequency. 

Claims i 

40 

1 . A video signal encoding apparatus for compressing and encoding a digital video signal, comprising: 



block structuring means for structuring the video signal into a matrix array of blocks each consisting of a plu- 
rality of pixels; 

46 transfor ma tion means for performing an orthogonal transform on each of the structured blocks to obtain a and 

encoding a digital video signal containing a chrominance signal, comprising: 

block structuring means for structuring the video signal into a matrix array of blocks each consisting of a 
plurality of pixels; 

so transformation means for performing an orthogonal transform on each of the structured blocks to obtain a 

transform coefficient; 

encoding means for encoding the obtained transform coefficient to obtain coded data; and 
unit structuring means for structuring units each comprising a plurality of blocks, prior to the orthogonal 
transform by said transformation means, by shuffling the blocks in such a manner that any given shuffling 
55 unit and four shuffling units most adjacent to said given snuffing unit belong to different units, the reference 

of the shuffling unit being the size that the blocks of said chrominance signal occupy on the screen. 

2. A video signal encoding apparatus for compressing and encoding a dgital video signal, comprising: 
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block structuring means for structuring the video signal into a matrix array of blocks each consisting of a plu- 
rality of pixels; 

transformation means for performing an orthogonal transform on each of the structured blocks to obtain a 
transform coefficient; 

s enooding means for encoding the obtained transform coefficient to obtain coded data: and 

unit structuring means for structuring units each comprising a plurality of blocks, prior to the orthogonal trans- 
form by said transf or mation means, in such a manner that any given block and four blocks most adjacent to 
said given block belong to different units. 

10 3. A video signal encoding apparatus for compressing and encoding a digital video signal, comprising: 

block structuring means for structuring the video signal into a matrix array of blocks each consisting of a plu- 
rality of pixels; 

transformation means for performing an orthogonal transform on each of the structured blocks to obtain a 

15 transform coefficient; 

enooding means for encoding the obtained transform coefficient to obtain coded data; 
unit structuring means for structuring units each comprising a plurality of blocks, prior to the orthogonal trans- 
form by said transformation means, in such a manner that any given block and four blocks most adjacent to 
said given block belong to different units; 

20 decision means for deciding the order in which the blocks are to be encoded, the order within each unit being 

such that encoding is performed starting with the blocks nearer to the center of the screen and then proceeding 
to the blocks nearer to the sides of the screen; and 
control means for controlling the amount of coded data on a unit-by-unit basis. 

25 
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